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A  widely  used  class  of  binary  trees  is  studied  in  order  to  provide  information  useful  in 
evaluating  algorithms  based  on  this  storage  structure.  A  closed  form  counting  formula  for  the 
number  of  binary'  trees  with  n  nodes  and  height  k  is  developed  and  restated  as  a  recursion 
more  useful  computationally.  A  generating  function  for  the  number  of  nodes  given  height  is 
developed  and  used  to  find  the  asymptotic  distribution  of  binary  trees.  An  asymptotic 
probability  distribution  for  height  given  the  number  of  nodes  is  derived  based  on  equally  likely 
binary  trees.  This  is  compared  with  a  similar  result  for  general  trees. 

Random  binary  trees  (those  resulting  from  a  binary  tree  sorting  algorithm  applied  to 
random  strings  of  symbols)  are  counted  in  terms  of  the  mapping  of  permutations  of  n  symbols 
to  binary  trees  of  height  k.  An  explicit  formula  for  this  number  is  given  with  an  equivalent 
recursive  definition  for  computational  use.  A  generating  function  is  derived  for  the  number  of 
symbols  given  height.  Lower  and  upper  bounds  on  random  binary  tree  height  are  developed 
and  shown  to  approach  one  another  asymptotically  as  a  function  of  n,  providing  a  limiting 
expression  for  the  expected  height. 

The  random  binary  trees  are  examined  further  to  provide  expressions  for  the  expectations  of 
the  number  of  vacancies  at  each  level,  the  distribution  of  vacancies  over  all  levels,  the 
comparisons  required  for  insertion  of  a  new  random  symbol,  the  fraction  of  nodes  occupied  at 
a  particular  level,  the  number  of  leaves,  the  number  of  single  vacancies  at  each  level,  and  the 
number  of  twin  vacancies  at  each  level.  A  random  process  is  defined  for  the  number  of 
symbols  required  to  grow  a  tree  exceeding  any  given  height. 

Finally,  an  appendix  is  given  with  sample  tabulations  and  figures  of  the  distributions. 


1.  Introduction.  A  binary  tree  is  a  finite  set  of  nodes,  either  empty  or  containing 
one  node  called  a  root,  such  that  all  other  nodes  are  partitioned  into  disjoint  sets  which 
are  respectively  called  left  and  right  subtrees  of  the  root.  The  subtrees  also  satisfy  the 
definition  of  a  binary  tree.  Thus,  a  binary  tree  is  an  unlabelled  rooted  arborescence 
with  successors  of  at  most  degree  two  distinguished  only  as  left  and  right. 

Figure  1.1  shows  a  binary  tree  with  six  nodes.  The  root  node  is  shown  at  the  top  and 
is  connected  by  arcs  to  two  immediate  successor  nodes  which  are  the  roots  of  its  left 
and  right  subtrees.  Each  node  with  no  successors  (for  instance,  the  left  subtree  of  the 
root)  is  called  a  leaf.  The  level  of  a  node  indicates  how  deep  it  is  within  the  tree.  Thus 
the  root  has  level  one,  its  immediate  successor  nodes  have  level  two,  and  so  forth  down 
the  occupied  portions  of  the  subtrees.  The  height  of  a  binary  tree  is  the  largest 


FlOLRfc  1.1.  A  binary  tree  with  six  nodes. 
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(.icon  pied  level.  A  full  binary  tree  has  no  internal  vacancies  (unoccupied  node  posi¬ 
tions). 

Binary  trees  are  frequently  used  as  information  storage  structures  on  digital  comput¬ 
ers.  For  instance,  one  of  the  most  popular  methods  of  randomly  retrieving  information 
by  a  key,  or  symbol,  is  to  store  the  key  data  in  a  binary  tree.  To  search  for  a  particular 
symbol,  we  begin  by  looking  at  the  root  and  proceed  by  applying  the  following  rules 
recursively: 

(1)  If  the  symbol  matches  the  root  symbol,  the  symbol  is  found. 

(2)  If  the  sy  mbol  is  “less  than”  the  root  (according  to  some  binary  ordering  relation) 
continue  the  search  by  considering  the  left  successor  of  the  root  as  the  new  root  (of  the 
left  subtree). 

(3)  If  the  symbol  is  greater  than  the  root,  continue  by  searching  the  right  subtree. 

(4)  If  there  is  no  root,  the  symbol  is  not  in  the  binary  tree. 

We  assume  for  simplicity  that  all  symbols  are  distinct  with  respect  to  the  ordering 
relation.  Otherwise,  the  ordering  relation  and  search  must  be  modified  in  an  obvious 
fashion.  The  construction  of  a  binary  tree  for  use  by  such  a  search  scheme  may  be 
performed  by  sequentially  examining  the  key  symbols  to  be  inserted.  This  binary  tree 
sort  is  a  one-pass  ordering  procedure  which  proceeds: 

(1)  If  there  is  no  root,  insert  the  symbol  as  the  root. 

(2)  If  the  symbol  is  less  than  the  root  symbol,  continue  by  considering  the  left 
subtree. 

(3)  If  the  symbol  is  greater  than  the  root  symbol,  continue  with  the  right  subtree. 

As  an  example,  consider  the  six  symbols  ABCDEF  and  a  lexicographical  binary 

ordering  relation.  Suppose  that  the  particular  permutation  of  symbols  examined  in 
BDAFCE.  The  resulting  binary  tree  is  shown  in  Figure  1.2,  and  has  structure  identical 
to  the  tree  in  Figure  1.1.  Note  that  this  same  tree  may  have  resulted  from  other 
permutations  of  the  same  symbols,  for  instance  BADCFE.  Therefore,  there  is  a 
many-to-one  mapping  of  key  symbol  permutations  to  corresponding  binary  trees. 

The  height  of  the  binary  tree  in  Figure  1.2  is  four  and  thus  the  maximum  number  of 
comparisons  required  to  insert  another  symbol  is  four.  Similarly,  if  this  tree  is  used  for 
retrieving  symbols,  the  maximum  search  length  for  a  symbol  in  the  tree  is  four,  and  for 
a  symbol  not  found  the  maximum  is  five. 

A  computer  implementation  of  a  binary  tree  storage  structure  requires  that  each 
node  be  represented  by  its  key  symbol  accompanied  by  sufficient  additional  informa¬ 
tion  to  identify  and  access  the  left  and  right  subtrees.  This  is  usually  accomplished  by 
use  of  a  dense  array  of  node  symbols  each  with  left  and  right  pointers,  by  node  storage 
via  address  calculation  into  an  array  space  sufficient  to  store  all  possible  binary  trees 
with  a  given  number  of  nodes  and  some  maximum  height,  or  by  some  similar  method. 

In  the  following  sections  we  study  this  widely  used  class  of  binary  trees  in  order  to 
provide  information  useful  in  examining  algorithms  based  on  this  storage  structure.  A 
closed  form  counting  formula  for  the  number  of  binary  trees  with  n  nodes  and  height  k 
is  developed  and  restated  as  a  recursion  more  useful  computationally.  A  generating 
function  for  the  number  of  nodes  given  height  is  developed  and  used  to  find  the 
asymptotic  distribution  of  binary  trees.  An  asymptotic  probability  distribution  for 
height  given  the  number  of  nodes  is  derived  based  on  equally  likely  binary  trees.  This 
is  compared  with  a  similar  result  for  general  trees. 


t 
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FiouRi  1.2.  A  binary  tree  with  inserted  symbols. 


KANLWJM  lil!V\K’r  IKht.S 


4"' 


Random  binary  trees  (those  resulting  from  the  binary  tree  sorting  algorithm  applied 
to  random  strings  of  symbols)  are  counted  in  terms  of  the  mapping  of  permutations  of 
n  symbols  to  binary  trees  of  height  k.  An  explicit  formula  for  this  number  is  given  with 
an  equivalent  recursive  definition  for  computational  use.  A  generating  function  is 
derived  for  the  number  of  symbols  given  height.  Lower  and  upper  bounds  on  random 
binary  tree  height  are  developed  and  shown  to  approach  one  another  asymptotically  as 
a  function  of  «,  providing  a  limiting  expression  for  the  expected  height. 

The  random  binary  trees  are  examined  further  to  provide  expressions  for  the 
expectations  of  the  number  of  vacancies  at  each  level,  the  distribution  of  vacancies 
over  all  levels,  the  comparisons  required  for  insertion  of  a  new  random  symbol,  the 
fraction  of  nodes  occupied  at  a  particular  level,  the  number  of  leaves,  the  number  of 
single  vacancies  at  each  level,  and  the  number  of  twin  vacancies  at  each  level.  A 
random  process  is  defined  for  the  number  of  symbols  required  to  grow  a  tree 
exceeding  any  given  height. 

Finally,  an  appendix  is  given  with  sample  tabulations  and  figures  of  the  distribu¬ 
tions. 

The  distributional  results  yield  a  hypothesis  test  for  randomness.  Such  tests  are  of 
particular  interest  in  areas  such  as  computer  science  and  cryptoanalysis,  where 
transformations  which  introduce  nonrandomness  must  be  efficiently  screened  to 
identify  the  transformation  groups  associated  with  such  behavior. 

The  class  of  random  trees  which  we  investigate  is  quite  distinct  from  the  widely- 
studied  equally -likely  trees  common  in  the  literature.  We  have  never  encountered  an 
application  leading  to  this  latter  class  of  trees,  and  suspect  that  its  popularity  is 
founded  at  least  in  part  on  the  relative  ease  of  analysis. 

The  mathematical  tools  brought  to  bear  on  this  problem  (especially  regarding  the 
asymptotic  behavior  of  sequences  of  functions^  are  not  commonly  used  in  the 
operations  research  literature,  but  are  recommended  to  those  interested  in  distribu¬ 
tional  as  well  as  worst-case  complexity  arguments. 

2.  Number  of  binary  trees  of  a  given  height.  In  this  section  we  consider  the 
problem  of  finding  the  number  of  binary  trees  with  n  nodes  and  height  k.  Denote  this 
number  by  t(n,k),  where  n  and  k  are  positive  integers.  Since  t(n.k)  =  0  unless 


k<n<  2k-\  (2.1) 

we  are  only  concerned  with  integers  n  -  1,2,  . . .  ;  k  =  1,2,  . .  .  satisfying  the  inequal¬ 
ity  (2.1). 

An  explicit  formula  for  the  numbers  t(n,k)  can  be  obtained  by  the  following  simple 
combinatorial  argument.  Consider  the  class  of  all  binary  trees  with  n  nodes  and  height 

k  which  have  exactly  m]  nodes  at  the  level  j  +  1 ,  j  =  I . k  —  1.  Let  m,  =  l,  +  r-r 

where  /;  and  rj  are  numbers  of  nodes  which  are  left  successors  and  right  successors  of 
nodes  at  the  level  j.  In  other  words.  /•  is  the  number  of  nodes  at  level  j  +  1  at  the  end 
of  left  going  arcs  emanating  from  nodes  at  level  j.  These  can  be  selected  in  ')  ways, 
and  the  ry  nodes  in  C'y')  ways.  Thus,  the  total  number  of  ways  to  arrange  the  arcs 
between  nodes  at  levels  j  and  j  +  1  is  given  by 

2 

/,  >  0  r,  >  0 
(,  +  r,  -  m, 

Since  m0  =  1  (the  root)  and  m,  +  •  •  •  +  mk  =  n  -  1  with  mi  >  1  for  /  1 . k  we 
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obtain  from  (2.2)  the  formula 


t(n,k) 


where  the  summation  is  over  all  integers  mjyj  =  1 . k  satisfying 

mt  >  1,  j  =  1,  . . . ,  k  -  1,  and  +  •  •  •  +  mk„l  =  n  —  I. 


(2.3) 


The  formula  is  valid  for  n  >  1  and  k  satisfying  (2.1),  for  n  =  1  we  have  trivially 

4(1,1)=  1- 

Although  (2.3)  is  an  explicit  formula  for  the  number  t(n,k)  it  is  not  very  convenient 
for  calculation.  An  alternate  way  is  through  a  recurrence.  Let 

t(n,k)  —  T(n,k)  —  T(n,k  —  1),  n>  1,  k  >  1,  (2.4) 


where  T(rt,k)  is  the  number  of  binary  trees  with  n  nodes  and  height  not  exceeding  k.  If 
we  define 

77/1,0)  =M  ![  w  =  °’  (2.5) 

v  ’  1 0  if  n  >  0,  ' 

and  T (0,  k)  -  1  for  k  >  0,  we  obtain  the  recurrence  relation 

T(n  +  l,k+  1)  =  2  T(j,k)T(n-j,k),  (2.6) 

7-o 

valid  for  n  >  0  and  k  >  0.  This  follows  from  the  fact  that  the  class  of  all  binary  trees 
with  n  +  1  nodes  and  height  not  exceeding  k  +  1  can  be  partitioned  into  n  +  1 
subclasses  according  to  the  number  of  nodes  j  in  the  left  subtree  of  root.  Since  the 
heights  of  both  the  left  and  right  subtrees  must  not  exceed  k  the  number  of  trees  in  the 
yth  class  is  the  product  T(j,k)T(n  ~j,k),  and  (2.6)  follows. 

Note  that  with  the  convention  (2.5)  the  recurrence  (2.6)  yields  automatically  T(n,k ) 
=  0  for  n  >  2k  -  1.  and  that  for  0  <  n  <  k,  T(n,k )  is  just  the  number  of  binary  trees 
with  n  nodes.  It  is  well  known  (see  Knuth  [8])  that  the  latter  are  Catalan  numbers 

i(2")-  "*0’  <2-7> 

so  that 

T(n,k)  -  C„  for  0  <  n  <  k.  (2.8) 

From  the  recurrence  (2.6)  one  easily  obtains  the  sequence  enumerators  defined  by 

fk(x)  =  2  T{n,k)x”,  k>  0.  (2.9) 

n  >  0 

Since  the  right-hand  side  of  (2.6)  is  the  Cauchy  product,  we  have  immediately 

2  T(n  +  \,k  +  1  )xn=//(x), 

n  >  0 

from  which  in  view  of  (2.5)  we  obtain 

fk  +  \{-<)  =  1  +  xf^(x),  k>  0, 
with  /„(*)=  1. 

Note  that /*(.*)  is  a  polynomial  in  x.  hence  if  x  is  regarded  as  a  complex  variable  fk, 
k  =  0. 1 . is  a  sequence  of  entire  functions. 


(2.10) 
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We  now  show  that  this  sequence  converges  uniformly  in  a  circular  region  of  the 
complex  plane,  specifically  that  as  A  -»  x 


4(z)~>«(z) 

<*>(*)  =  2 

n  —  0 


uniformly  for 


<  i ,  where 
** 


(2.11) 

(2.12) 


To  see  this,  note  that  with  C„  as  in  (2.7)  we  have  T(n ,  k )  <  C„  for  all  k  which  together 
with  (2.8)  yields 

»(z)-fk(z)  =  zk'2(Cn-T(n.k))z'-k, 

n  >  k 

so  that  for  jzj  <  \ 


W-) -/*(*)!<  2  c„W<  2  C,4  ". 

n  A  n  ■■  A 

which  is  a  tail  of  the  expansion  w(t)  =  2. 

This  result  will  now  be  used  to  develop  an  asymptotic  distribution  of  the  numbers 
t{n,k)  as  A  -*  x.  In  doing  this,  we  follow  the  method  of  Reny i  and  Szekeres  used  in  [9] 
for  a  similar  problem. 

From  (2.4)  we  have  for  A  >  1,  t(0,k)  =  0, 


2  t(n’k)zn=fk(z)-fk (2-13) 

n  >  0 

which  are  entire  functions  for  every  A  >  1.  Hence  by  the  Cauchy  formula 


1  r  fk(z)  ~  fk  - \(z) 


,n+  1 


dz. 


(2.14) 


where  we  take  the  circle  \z\  =  \  as  the  contour  of  integration.  To  estimate  the  integral 
we  use  Laplace’s  method  by  first  showing  that  as  A  ->  x  the  only  significant  contribu¬ 
tion  of  the  integrand  is  in  the  vicinity  of  positive  real  axis. 

For  z  —  \e‘*’  the  function  u(z)  defined  by  (2.12)  becomes 


=  2e  ‘^1  —  Vl  e'* —7r<<jy<7r.  (2-15) 


Its  modulus  2|1  -  Vl  —  e,<f  j  is  a  decreasing  function  of  |«J>|  on  0  <  |<>|  <  m  with 
maximum  2  at  <j>  =  0  and  minimum  2(y2  -  1  )  at  |<>!  =  m. 

Now  use  (2.10)  to  write 

/*+ i(-)  -  "(z)  =  z[fk(z)  ~  "(*)][/*(*)  +  «(*)]  (2-16) 


and  choose  an  arbitrary  0  <  e  <  2/3.  By  (2.1 1)  there  exists  K(  such  that 

A  >  K(=*\fk(z)  -  w(z)|  <  e  for  all  <>  (2. 17) 

whence  using  (2.16) 

!/*+■(*)  “  w(z)l  <  «(2|«(i*''*)l  +  c) 

<  4ejl  -  Vl  -  e*  |f  1  + - - - ) 

1  A  2\a>(Le‘*)\  I 

<  12«|1  -  Vl  “  e*  since  \u(±  e*)|  >§  .  (2.18) 
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Iterating  (2.16)  we  thus  find  that 

_ _  m 

l//c,  +  m(z)  ~  w(2)l  <  (l2c|  1  Vl  e'*  I)  (2.19) 

for  all  m  =  1,2,  ..  .  and  all  tp,  i.e.,  that 

Mi?")  -  “(JO  +  o(|l  (2.20) 

uniformly  in  <p. 

Since  for  sufficiently  small  |<>|  we  have 

ii-vi-^i<  i-ivsr  (2.2i) 

we  can  choose  for  instance  a  sequence  <pk  =  (In 2k/k)2  for  which  as  k-*  oo 

sup  1 1  -  -  e*  \<e~^k  (2.22) 

w>i  >  o* 

and  thus  conclude  that  for  |$|  >  <pk 

+  (2-23) 

as  k—>cc. 

If  we  now  substitute  (2.23)  into  (2.14)  we  have  with  z  = 

K  A(  i-**)  -  A- . 

ein+d<p+4no{e-'"lk),  (2.24) 

as  k  -»  oc . 

To  estimate  the  remaining  integral  we  first  set  <f>  =  0  and  call 

«»=/*(*)•  (2-25) 

The  recurrence  (2.10)  yields 

«*+i  =  l+W.  «o=1  (2-26) 

from  which  by  substitution  yk-2  -  ak  we  obtain  the  recurrence 

Y*+I  =  y '*-$?*>  Yo=l-  (2.27) 

Thus  yk  is  the  &th  iterate  of  the  function  h(x)  =  x  -  \x2,  and  iterates  of  such 
functions  can  be  handled  by  standard  methods  (see,  e.g.,  de  Bruijn  [4,  §8.7  or  Exercise 
8.11]).  We  obtain  the  asymptotic  expansion 


(2.28) 


as  k  x  where  c  is  a  constant. 

To  obtain  the  expansion  for  |4>[  <  (ln2k/ k)2  we  use  the  method  of  variable  coeffi- 
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cients  on  (2.28).  See  Szekeres  [12].  Put 

fk(\e*)  =  2  ~  \  £i(&  )  +  )  “  fx  &(**  )  +  •  •  •  (hiSher  order  terms)- 

(2.29) 

where  £/  - 

For  4>  fixed  we  then  have  |^  +  1  =  |A.  +  |a/A  so  that  upon  substitution  and  using  the 
expansion 

_  ( /■  i  1  _  _  i/-\  i  £  ,  r\  i 


*i($ +  i^) =  + 1 +  " '  • 

&(*  +  £  f)  =  &($)  +  |  gl  (i)  +  •  ■  • . 


i  =  i  l  =  1  _  _L  +  J_ 

A  +  1  A  i  +  A  /c2  A;3 


- ! — _  =  -L - 1 - -  =  4r  -  \  +  •  ■  •  ,  and  ln(  A  +  1 )  =  In  A  +  - 

(*+i)2  *2(1  +  a-)2  *2  *3  K  }  k 

we  obtain  after  collecting  terms 

/*+.(!**)  -  2  - 1  *,(£)  ~T  S2(«) +  II  [  *■(£)  -  -  %(£>] 


[2«!(£)  -  *;(£>]  + 


(2.30) 


On  the  other  hand  from  (2.10)  fk  +  i(\e‘'t>)  =  1  +  ^  e‘*f£(\e'*),  whence  using  (2.29)  and 
e‘*  =  eit/kr  =  i  +  (|/A)2  +  -  •  • ,  we  have 


i(  y°)  -  2  -  J  hit)  +  t®r  *2(«)  +  p  [  J  sf(«)  + « 2  -  c«j(£) 
-  5  «.(£)«!(£)  +  ■  ■  ■  ■ 


(2.31) 


Comparing  terms  of  (2.30)  and  (2.3 1 )  we  obtain 

gift  -  m)  -  i  «?(€)  +  £2-  2&(i)  ~  \  *.(*)&(*)• 

with  initial  conditions  g,(0)  =  g2(0)  =  4  from  (2.28).  These  equations  have  a  solution 
£,(|)  =  2£cot±|,  g2(|)  =  £2(sin4£)'2, 

which,  when  substituted  back  into  (2.29),  gives 

/»(!«*) -2- f  «cotii+  !sf  £2(sin{{)-2+  0(k--), 


where  ( 2  -  i&2.  |* |  <  [  ^  )  .  (2.32) 


/*(*«*)  -  f&colj*,  +  taf  )  : 

ln(A  —  1)  2  . 

"  +  0(*  ■’)• 


(*-lf 
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and  setting  so  that  .  x/(k  -  1)  =  £/ k ,  and  using  the  expan¬ 

sions 


coti(£-i{)  =  cotI{+ij(smi{) 


ln(A'  -  1)  =  In  A-  -  ^  +  •  •  • ,  and 


(s4(J4s))  J=(sinH  ‘('+i«coti{+..-), 

we  obtain 

-/*-,(  i«*)  -  ^€-(sin  ij)"2-  M  ( W  I  £(sin  I «)  " " 

+  j>(sini()  2-^icoli{(si4{)  2+°(<rJ>- 

However,  for  £2  =  /<J>A:2,  |</>|  <  (ln2A:/A:)2,  we  have 

£cot||  =  0(ln2/r),  |sin|£j_2-»0  as  k->  oo, 

so  that  we  can  write 


AUO -/*-■(!**)- 


i  e 

k2  sin2 


(2.33) 


The  same  expression  can  also  be  obtained  by  again  substituting  (2.32)  into  the 
recurrence  (2.10).  If  we  now  insert  (2.33)  into  the  integral  in  (2.24)  and  use  the 
substitution  £2  —  i<f>k2.  (2.24)  becomes 


£3 


sin2i! 


,  -  n(2//rt 


di+  4 


(2.34) 


where  the  path  of  integration  T  as  determined  from  the  condition  |<£j  <  (In 2k/k)2  is 
(see  Figure  2.2): 


£=T(  — 1  +  /) 
£  =  r(l  +  /) 


for - —  ln2&  <  r  <  0, 

V2 

for  0  <  r  <  —  In 2k. 

V2 


FlUl  RE  2.2 
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The  integrand  is  a  meromorphic  function  with  poles  at 

im  =  2irm,  m-±  1,±2,  ....  (2.35) 

and  corresponding  residues 


<■(£,)-« 


3(2t77m)2  —  2-~  (Inm)4 
k~ 


,  —  n(  )  •’  i  k 


(2.36) 


Before  applying  the  residue  theorem,  we  have  to  close  the  path  of  integration,  for 
instance  by  the  arc  (see  Figure  2.2)  A:  i-pke',f>,  pk  -  ln2&,  -w/4  <  0  <  tt/4. 
Deforming  the  arc  in  the  neighborhood  of  the  real  axis  so  that  it  passes  approximately 
in  between  the  two  consecutive  poles  we  have  from  |sinz|2  =  sin2*  +  sinh2y 

[sin  a  '  2  <  1  +  tj,  Tj  >  0, 


along  the  arc  for  large  enough  k.  Hence 


L 


C 

sin2 


<  (1  +  ij)pkjj/2e  de 


<  (1  +  7 


■J.L1 


0+rD^k  ./e 


2  n 


0~e' 


) 


(1  +  7])-upkk2 

<—  2„  f2pl  =  0{\n*k)  as  k—>cc.  (2.37) 


Finally,  applying  the  residue  theorem  to  the  integral  over  the  contour  T  +  A  and  using 
(2.34),  (2.36)  and  (2.37)  we  have  the  asymptotic  expansion 


'(">*)  =  4n+1  y-4  2 

k  m  >  1 


2  ~  (2 wm)4  —  3(2irmf 


e  '  n(2rrm)‘  /k‘ 


+  4nO(e~in'k)  +  4"o|  ^  j  +  4"d|  ^  j. 


(2.38) 


From  here  we  can  get  the  distribution  of  heights  of  binary  trees  with  n  nodes  or 
equivalently  the  probability  that  a  randomly  selected  binary  tree  has  height  k  if  all 
binary  trees  with  n  nodes  are  considered  equally  likely. 

Calling  this  quantity  pn(k )  we  have 

t(n,k) 

Pn(k)  =  .  (2-39) 

where  C„  -  ^kt(n,k)  is  the  Catalan  number  (2.7).  Using  the  asymptotic  expansion  [7] 
Cn  =  4n/n2'2\frr  +  0(4nn~5/2)  we  obtain  for  large  k  and  n, 


(2.40) 


where  /3  -  n/k 2.  Note  that  if  k  <  «,  which  is  the  case  of  interest,  the  dominant  O-term 
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is  the  last  one.  Thus,  if  for  some  arbitrarily  small  S  >  0 

ni/$+s  <  k  <  n, 

then  as  n  ->  oc 


21  -<2™.)2/3 


Pn(k)~%^  P2  2J2^(2iTm)4-3(2wm)2]e 
To  verify  that  (2.42)  is  indeed  a  probability  distribution  note  that  as  n->  oc 

S>»(*)~8  (P22  \2/3{2Km)4-3(2TTm)2}e-<2™^dk, 

*  n  J  m-i L  J 


(2.41) 


(2.42) 


where  the  limits  of  summation  and  integration  are  given  by  (2.41).  Making  a  substitu¬ 
tion  k  —  \n / /3  we  have  as  n  ->  oc 

4vV  f  ( (2t™)4  ('°2py2e~l2*m)lfi  dp  -  (2tt/m)2  f  *3/3  l/2e_(2,”"):/}43  ). 
m=n  •'0  -*0  J 


But 


f  °°3/3 1/2e ^  +  (2^W)2  f*2j&3/V 

Jo  o  ^0 


(2t rm)2fi 


d^ 


and  we  are  left  with 


lim  4yV  ^  2(27 rm)2p2''2e  —  f00  p.2e~  ^  dj>.=  1 

by  the  substitution  p2  =  {2-nm)2p.  If  we  denote  £/„  as  the  random  variable  with 
distribution  pn(k)  and  call  the  asymptotic  distribution  function 


F(x)  =  lim  P 

'  n-n o 


(2.43) 


we  obtain  by  integrating  the  right-hand  side  of  (2.42)  over  0  <  k  <  2x{n  by  the  same 
method  as  above 


/"(x)  =  4jc“37t5'/2  2  m2e  (7rm/J£)2.  (2-44) 

m- ) 

It  can  easily  be  seen  from  (2.43)  that  for  x  -  O(yn),  F(x)-»0  and  thus  the 
asymptotic  distribution  (2.42)  or  equivalently  (2.43),  (2.44)  is  valid  for  all  k  in  the 
range  (2.1)  as  n~*  oc  (see  Appendix,  Figure  A.l).  This  is  identical  with  the  distribution 
function  obtained  by  Renyi  and  Szekeres  (see  [9,  p.  506])  for  the  height  of  general  (as 
opposed  to  binary)  trees.  Only  the  normalizing  factor  in  (3.43)  differs  by  a  constant, 
nameiy  y'2  .  More  precisely,  if  is  the  height  of  a  general  tree  with  n  nodes  (i.e.,  with 
no  restriction  on  the  number  of  successors  of  a  node)  then  for  large  n 

2$;,  (2.45) 

a  somewhat  surprising  result. 

3.  Binary  trees  generated  by  random  permutations.  As  described  in  the  introduc¬ 
tion  the  tree  insertion  algorithm  defines  a  map  which  assigns  to  every  permutation 
77*(  1 . n)  a  binary  tree  with  n  nodes.  If  all  permutations  of  integers  1 . n  are 
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considered  equally  likely  the  resulting  trees  are  referred  to  as  random.  Thus,  every 
numerical  quantity  defined  on  a  binary  tree  becomes  a  random  variable. 

Let  „  be  the  height  of  a  random  binary  tree  with  n  nodes,  i.e..  generated  by 
random  permutations  of  the  first  n  integers.  Clearly,  the  probability 

?{*"«  <*k)  =  ±B(n,k),  (3.1) 


w'here  B(n,k)  is  the  number  of  permutations  tt(1,  .  . .  ,  n)  mapped  into  trees  with 
height  not  exceeding  k  (see  Appendix,  Table  A.2). 

In  order  to  obtain  an  explicit  expression  for  (3.1)  we  first  need  a  suitable  indexing 
system  for  the  nodes.  A  natural  way  to  do  this  is  to  consider  first  a  full  binary  tree  and 
label  its  nodes  by  the  sequence  of  positive  integers  starting  from  the  root  and  labelling 
in  each  subsequent  level  from  left  to  right  (see  Figure.  3.1). 

Thus  nodes  at  level  j.j-  1,2, ... ,  from  left  to  right  have  labels 

2f~ 1  +  m,  m  =  0 . 2'-'.  (3.2) 


Note  that  the  left  and  right  successors  of  a  node  labelled  x  have  labels  2x  and  2x  +  1, 
respectively,  left  successors  always  have  even  labels,  right  successors  have  odd  labels 
(the  root  being  an  exception). 

Next  consider  a  fixed  binary  tree  with  n  nodes  and  for  every  label  x  of  the  form 
(3.2)  define 


0 

1  +  d(  2x)  +  d(2x  +  1) 


if  there  is  no  node  with  label  x, 
if  there  is  a  node  with  label  x. 


(3.3) 


Note  that  d(x)  is  simply  the  number  of  nodes  in  a  subtree  with  root  at  x,  in  particular 
d(\)  =  n  and  d(x)  -  1  if  and  only  if  the  node  labelled  x  has  no  successor,  i.e.,  is  a  leaf. 


Lemma.  There  is  a  one-to-one  correspondence  between  binary  trees  with  n  nodes  and 

height  not  exceeding  k  and  the  set  D(n,k)  of  vectors  (d(\),d(2) . d( 2k  -  1))  with 

nonnegative  integral  components  satisfying  the  conditions: 

(1) d(\)=n, 

(2)  for  all  j  =  1,  . . . ,  k 

2J~ 1  <  x  <  2>  -  1  =>0<  d(x)  <  2k "-'+1  -  1, 

(3)  d(2x)  +  d( 2x  +  1 )  >  0  =s>  d{x)  =  1  +  d(2x)  +  d(2x  +  1 ). 

Proof.  Given  a  binary  tree  the  numbers  d(x)  are  uniquely  defined  by  (3.3). 
Property  (1)  is  obvious,  property  (3)  follow's  from  (3.3)  since  nodes  2x  and  2x  +  1  are 
successors  of  node  x,  hence  if  there  is  a  node  with  labels  either  2x  or  2x  +  1  there 
must  be  one  with  label  x.  Property  (2)  is  necessary  since  if  it  were  violated  then  there 
would  be  a  subtree  with  root  at  level  j  having  at  least  2k~J*  1  nodes  and  the  height  of 
the  tree  would  then  exceed  k.  Conversely,  given  a  vector  (d(l),  .  .  .  ,  d(2k  —  1)) 
satisfying  ( 1 )— (3),  construct  first  a  complete  binary  tree  of  height  k  and  then  eliminate 
all  nodes  with  labels  x  such  that  d(x)  —  0.  Properties  (l)-(3)  then  guarantee  that  the 
result  is  a  binary  tree  with  exactly  n  nodes.  ■ 


1 


level  1 
level  2 
level  3 
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A  vector  (d(l), . . . .  d(2k  —  1))  E  D(n,k )  can  be  used  to  compute  the  number  of 
permutations  mapped  into  the  tree  corresponding  to  this  vector.  Consider  a  permuta¬ 
tion  77(1, ....  n)  =  (jj, ....  sn)  mapped  into  a  binary  tree  with  height  not  exceeding  k 
and  look  at  the  node  into  which  a  particular  symbol,  say  s„  is  mapped.  Let  xt-  be  the 
label  of  this  node  and  let  L(s.)  and  /?($,)  be  the  ordered  subsets  of  ($,,... ,  sn),  which 
are  mapped  into  the  left  and  right  subtrees  of  the  node  xt.  By  the  very  nature  of  the 
mapping  if  the  elements  of  L(s ,)  U  R(S;)  are  reshuffled  the  tree  is  not  changed  as  long 
as  the  order  of  elements  within  each  subset  L(s;)  and  R(st)  of  L(s,)  U  /?($,)  is 
preserved.  But  the  numbers  of  elements  in  L(st)  and  R(s,)  are  d( 2xt)  and  d( 2x,  +  1) 
respectively  so  that  the  number  of  permutations  resulting  from  such  a  reshuffle  is 

[d$x)  +  d(lx,+  l)]l  [rf(.y)-l]! 
d(2x)\d(2x:  +  1)!  d(lx^)\d(2xl  +  1)! 


using  (3.3)  and  d(x,)  >  0  (since  st  is  mapped  into  a  node  labelled  *,).  Repeating  this 
argument  for  each  node  of  the  tree  corresponding  to  (d(l), . .  .  ,  d(2k  —  1))  gives  the 
total  number  of  permutations 

TT  K*)-']' 

11  d(lx)'.d(2x  +  1)!  ' 

where  the  product  is  over  all  d(x)  >0,  x  =  1 . 2k  —  1.  From  here  using  the 

lemma  and  (3.1)  we  obtain  the  formula 


P(W„ 


< 


*)- 


n\ 


2  n 

d  (r  D(n,k) 


d(x)  -  1  ]! 
d(2x)\d(2x  +  1)! 


(3.4) 


Note  that  if  d'  =  (d'(\) . d'(2k  -  1))  is  obtained  from  d  =  (d(  1),  .  .  .  ,  d(2k  -  1))  by 


d'(x) 


d(x)  if  d(x)  >  0, 

1  if  d(x)  =  0, 


the  formula  (3.4)  takes  on  a  simpler  form 


P(JtrH<k)->  2 

de  D(n,k) 


2*  -  1  \  1 

n 


(3.5) 


(3.6) 


Unfortunately,  except  in  a  few  special  cases,  the  set  D(n,k )  is  quite  complicated  for 
(3.6)  to  be  useful  for  computation.  Again  a  recurrence  relation  may  be  preferable. 

Indeed,  such  a  recurrence  is  quite  easy  to  derive.  Consider  the  left  and  right  subtrees 
of  the  root  of  a  random  binary  tree  with  n  nodes.  If  J  is  a  random  number  of  nodes  in 
the  left  subtree  then  we  have 

•*"*+,  =  1  +  max {Wj  }.  (3.7) 

Now  J  is  simply  the  number  of  symbols  in  a  random  permutation  7r(l,  ...,«  +  1) 
=  (5,,  .  .  .  ,  jn+1),  which  are  less  than  Therefore,  J  is  uniformly  distributed  over 

10.  .  .  .  ,  n),  and  j  and  Wn.j  are  conditionally  independent  given  J.  Consequently 

P{-X'n+  l  -  1  <  k\J  =j)  =  <  k)P(  n-y  <  k), 

from  which  by  calling 


F(n,k)—  P{  '//„  <  k) 


(3.8) 
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and  taking  the  expectation  we  obtain  the  recurrence 

F(n  +  l,k  +  1)  =  — ]—r  2  F(j '\k)F(n  -  j.k)  ■ 
n  +  1  ■-() 

valid  for  n  J  0,  k  >  0 
if  we  define  F(0,k)  =  !  *  A  =  0, 

y  ’  1 0  if  k  >  0.  j 

It  may  be  noted  that  (3.9)  gives 

F(n,k)  =1  for  0  <  n  <  k ,  and  1 

F(n,k )  =  0  for  n  >  2k,  as  expected.  j 


(3.10) 


Now  (3.9)  can  be  used  to  calculate  the  numbers  F(n,k)  for  moderate  values  of  k  and  n 
(see  Appendix,  Tables  A.2  and  A.3,  and  (2));  however  the  memory  requirements 
increase  rapidly. 

It  can  also  be  used  together  with  (3.10)  for  some  special  choice  of  n  and  k.  For 
instance  for  n  close  to  2k  —  1  we  obtain 


’ 

k  ->  1, 

(3.11a) 

F(2k -2,k)  =  (2k  -  \)F(2k  - 

1,*), 

k>\, 

(3.11b) 

F(2k  -  3,  At)  =  (2*  -  1)(2*“'  - 

\)F(2k  —  l.it). 

k  >  2, 

(3.11c) 

and  a  few  more.  Note  that  (3.1  la)  is  the  probability  of  obtaining  a  full  tree  of  height  k. 
However,  these  few  terms  are  of  minor  interest. 

Another  possible  approach  is  to  use  generating  functions.  Defining 


/*(*)  =  2  F(k,n)xn,  k>  0, 

n-0 


(3.12) 


we  have  immediately  from  (3.9)  the  relation 

fk+ iW  =  1  +  (*fk(y)ty  M*)  «  1.  (3.13) 

Jo 

Note  that/^x)  are  again  polynomials,  fk(0)  =  1,  and  that  for  all  x  £  (0, 1] 
fk(x)<fk+i(x)^>(\ -xf'  as  k^>  oo. 

It  is  hoped  that  (3.13)  can  be  used  to  obtain  an  asymptotic  distribution  of  the  heights 
.  However,  we  have  not  been  successful  in  that  respect  to  date. 

It  has  been  suggested  to  us  by  A.  Washburn  that  a  lower  bound  on  the  expected 
height  E  }  can  be  easily  obtained  from  (3.7).  Taking  expectation  we  have 

=  1  +  £{max(;r,  (3.14) 

and  conditioning  upon  J 

E  { max(^r, ,  )  |  J  =  j }  >  max(£  { } .  £  ( W } ). 

Since  J  is  uniformly  distributed  over  (0,  ....  n)  this  implies 

£{max(^,^-/)}  >-~r  Sma  x(£  [^}  },£  (^  }).  (3.15) 

n  +  1  j= o 
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Hence  if  an,  n  =  0, 1,  .  . . .  is  a  sequence  of  numbers  defined  recursively  by 


1 

«„+i  =  1  +  ^jry  2  max(°ra"->  «o  =  °> 

it  is  easily  seen  from  (3.14)  and  (3.15)  that 

E  {  yfn  j  >  a„  for  all  n  =  0, 1, ...  . 

Note  that  an  is  a  strictly  increasing  sequence  so  that 


2  max(a,,a„_y)  =  - 
0 


2  ^  a;.  for  n  odd, 

n/2  <j<  n 

a„,2  +  2  2  aj  for  n 

n/2<j< n 


even. 


Since  clearly  a„  =  O(ri)  we  have  as  n^>  x 

a/2  <j**n 

which  upon  approximating  the  sum  by  an  integral  yields 

a(/)~l+2f  a(tv)dy. 

J 1/2 

On  the  other  hand,  let  Y  be  a  random  variable  defined  by 

\j  if 

{  n-J  if  Wj  <  . 


Then 


and  by  (3.7) 


nY  <  max{  J.n  -  J  ), 


F{<r„+l}  =  i  +  f(r„K}. 


(3.16) 


(3.17) 


(3.18) 


(3.19) 

(3.20) 


Now  for  large  n,  Y  is  uniformly  distributed  over  ( j,  1),  and  since  ju(n)  =  E  ( }  is  an 
increasing  function  of  n  we  get  from  (3.20)  by  conditioning  on  Y  and  applying  (3.19) 
the  asymptotic  inequality 


H(n)  <1  +  2 J  (i(nv)dy 


(3.21) 


valid  as  n-+  oo.  But  this  together  with  (3.17)  and  (3.18)  indicates  that  we  should  have 
in  fact 


fi(n)~a(n )  as  n->  cc 


(3.22) 


where  a(t)  is  a  solution  of  (3.18).  It  is  easily  verified  that  (3.18)  has  a  solution 


«(0  = 


lnr 

1  -ln2 


+  a. 


where  a  is  an  arbitrary  constant.  But  as  n(n)->  x  the  constant  can  simply  be 
disregarded  and  we  have  an  asymptotic  equivalence 


/i(«)— (1  —  ln2)  'inn  =  3.25889 In n 


(3.23) 


as  n~>  oc  (see  Appendix,  Tables  A.2  and  A. 3). 
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Since  completion  of  this  work,  Robson  [10]  has  shown  that  asymptotically  3.63 
<  fi(n)/\nn  <  4.31. 

4.  Further  results  for  random  binary  trees.  We  now  turn  our  attention  to  some 
other  quantities  associated  with  random  binary  trees.  In  particular  we  wish  to  study 
the  process  of  growing  the  tree  as  the  number  of  nodes  increases.  A  convenient  way  to 
visualize  such  a  process  is  to  consider  a  sequence  Xl,X2, ...  of  independent  random 
variables  with  a  common  continuous  distribution  and  take  as  our  permutation  the 
ranks  of  A',, .  .  . ,  Xn  for  each  n=  1,2,  .. .  .  Clearly,  for  each  fixed  n  all  resulting 
permutations  are  equally  likely,  and  each  new  symbol,  i.e.,  rank  of  Xn+ ,,  is  uniformly 
distributed  over  {1, . . . ,  n  +  1). 

If  for  some  n  we  have  a  random  binary  tree  then  a  new  symbol  will  be  mapped  into 
a  new  node.  This  new  node  can  appear  in  one  of  n  +  1  possible  locations  in  the  current 
tree.  Following  Knuth  [7]  we  will  refer  to  these  locations  as  vacancies  of  the  current 
tree.  They  are  depicted  in  Figure  4. 1  as  empty  squares. 

Let  for  n  >  \,j>2,  VnJ  be  the  number  of  vacancies  at  level  j  at  time  n,  i.e.,  in  a 
random  binary  tree  with  n  nodes.  Clearly  Vnj  =  0  fory  >  n  +  1  and 

K.2+V„,3+  ■••  +  K,n+  1  —  n  +  1.  (4.1) 

Also,  obviously  Vl2  =  2.  Suppose  now  that  at  time  n  the  vacancies  have  values 
v„2<  •  ■  ■ ,  vn„  +  \ ■  Since  a  new  node  is  equally  likely  to  fill  any  of  the  current  n  +  1 
vacancies,  the  probability  that  it  will  fill  a  vacancy  at  level  j  equals  vn  //(n  +  1).  But 
then 


K+lj  =  Vnj~  !> 

Vn+\j+\  =  %+i  -*-2,  while 

K+ 1,  =  fora11  i*j+  1- 

Thus  if  we  define  a  sequence  of  random  polynomials 


K(x)  =  £  Vn4x>-\  n>  1, 
j>  2 

we  see  that  these  polynomials  form  a  Markov  chain  with  transition  probabilities 
P{Wn+\(*)  =  w„(x)  +  xJ-\2x  -  1)|  Wn{x)  =  wa(x))  =  , 

n  >  1,  j  =  2, ...,«  +  1,  and  the  initial  state  W^(x)  =  2x. 


In  particular  we  then  have 

»;(*)}-  K(*)+ 


n  +  2 

\ 

4 

j' 


C*J 


-  W*(x) 


1  + 


2x  -  1 
n  +  1 


n  >  1. 


Figure  4.1 
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Consequently 

*{»n(*)}  =  ('  +  “)(>  +  Tff) ' ' '  ('  + 


fl+1  f 


-  ±2x(2x  +  1)  •  -  (2x+  »-  1)=  Jy  2  [jl  l  («) 

where  ["  ]  are  Stirling  numbers  of  the  first  kind  in  Knuth's  definition  and  notation  [7]. 
Thus 


2J~ 


j- 1 


j  =  2,  .  .  .  ,  n  +  1,  «  >  1, 


(4.3) 


gives  us  the  expectation  of  the  number  of  vacancies  at  various  levels  (see  Appendix. 
Table  A.5).  Other  quantities  of  some  interest  may  be  obtained  from  here.  For  instance 
the  expected  distribution  of  the  number  of  vacancies  over  levels  is  immediate 


vn(j)  =  E  {  K.j  } 


2j~ 1  r  n 

F+’oru-'r 


j  =  2,  . .  . ,  n  +  1 ;  n  >  1 .  (4.4) 


Next  let  K„  be  the  number  of  comparisons  needed  to  insert  a  new  (n  +  l)st  symbol 
into  a  binary  tree  by  the  algorithm  described  in  the  introduction.  Then  the  probability 


p(k,-k)-e{P(k.-k  |r„(*))} 


I  ^ n-K  *  1 

[  n  +  1 


-  ’-(«  +  0. 


k  =  1, .  . . ,  n,  n  >  1, 


since  the  numbers  of  comparisons  is  k  if  and  only  if  the  symbol  fills  a  vacancy  at  level 
k  +  1.  Recalling  that  the  Stirling  number  ["]  is  also  the  number  of  permutations  of  n 
symbols  with  exactly  k  cycles  we  can  write 


p(Kn  =  K)  =  7+T9^K)’ 

where  q„{K)  is  the  probability  that  a  random  permutation  of  n  symbols  has  k  cycles. 
(One  wonders  whether  the  number  of  comparisons  can  be  related  to  the  number  of 
cycles.)  (See  Appendix,  Table  A.7.) 

With 

the  generating  function  (by  definiton  of  the  Stirling  number)  is 

g„o) = i  p(k = «)*•-  n  V 1  • 

K=l  -C=l  K  +  1 

Hence  kn  is  a  convolution,  k„  —  +  •  •  •  +  x„,  where  the  are  independent 

Bernoulli  variates: 


x, 
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Consequently 


».-£(*.) +  }  +  +  ^h-) 


with  Hn  representing  the  Harmonic  number  [7],  (See  also  the  Riemann  Zeta  function 
in  Abramowitz  and  Stegun  [1].) 


°,2  =  Var(*„)  -  2  M„=  2  2  TXT  -  4  2 


1 


C=1  K-l  K  +  ^  K-l  (K  +  l)2 

=  2 <Jf„,  -  1)  -  4 (»«>,  -  1)  »  2H„t,  -  4 H<;’,  +  2. 

Since  asymptotically  [6] 

«,  =  r  +  l„n  +  ^  +  0(X),  + 

with  y  Euler’s  Constant,  0.5772  . .  . ,  we  obtain  for  the  number  of  comparisons 
fi„~2{y  —  1  +  ln(n  +  1)}  =  21n(n  +  1)  —  0.8456  .  . . , 

<j2~2  j  y  +  1  -  ^  +  ln(«  +  1)  j  =  21n(n  +  1)  -  3.4253  .... 

Since  |*J  <  1  and  <r„  ->  oc,  the  central  limit  theorem  applies,  yielding 

Also,  by  the  Berry'-Esseen  Theorem  (cf.  Feller  [5,  p.  544]) 


sup 

X 


n 


<KX) 


<4. 


where 


rn  =  2  e\*k~p*?  =  2  pMpZ  +  ql\ 

K-l  K-I 

Asymptotically, 

-  1)  +  4 («„<?,  -  1)  -  3(W„<J,  -  1)  +  1 

+  4(0.20205  .  .  .  )  —  ~  +  y  4-  ln(«  +  1)  +  1 

=  ln(n  +  1)  -  4.714  .  . . , 
so  that  the  bound 

6^~o(  *  ' 


m 

45 


■\Jln(n+"l) 


which  gives  a  fairly  high  rate  of  convergence  to  normality  for  the  distribution  of  the 
number  of  comparisons  to  insert  a  new  symbol.  The  results  given  can  be  used  to 
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analyze  a  complete  binary  insertion  sort  algorithm  operating  on  a  random  string  of 
symbols.  Higher  order  expansions  are  also  possible  for  describing  this  process. 

Further,  let  Mn]  be  the  number  of  nodes  at  level  j  in  a  random  binary  tree  with  n 
nodes.  Since  clearly 


,  +  V,J+i  =  2Mnj ,  j  =  1 . n,  with 

A/„  i  =  1,  M„  „  +  j  =  0,  n  >  1,  whence 

n 


Mn}  ~  j  2  (  2  )  i>  or  uPon  taking  the  expectation 

y-i. 


(4.5) 


=j 

■l  " 


,  n,  n  >  1. 


(See  Appendix,  Table  A.4.) 
Note  that 


-As 


'•  =  / 


is  an  expected  fraction  of  nodes  occupied  at  level  j.  It  may  be  called  an  expected 
relative  thickness  of  the  tree  at  level  j.  Looking  again  at  Figure  4.1  we  see  that  we  can 
distinguish  between  two  kinds  of  vacancies,  those  which  are  attached  to  a  leaf  (and 
hence  come  in  pairs)  and  those  attached  to  an  internal  node  of  the  tree.  We  will  refer 
to  these  two  kinds  as  twin  and  single  vacancies  respectively. 

Let  for  n  >  \,j  >  2,  be  the  number  of  single  vacancies  and  V$  the  number  of 
twin  vacancies  at  level  j  in  a  random  binary  tree  with  n  nodes.  Clearly 

Ki '  +  Kf  =  K/.  riS = o.  rfj  =  2. 

If  at  time  n  the  values  of  these  variables  are 

,.<>)  f.nA  ,.(2)  ,.(2) 

*'/i, 2’  '  *  ■  ’  ttn,n  +  I  •  •  •  > 

then  filling  a  single  vacancy  at  level  j  results  in 

yO)  _  ,,(!)_]  ]/(2)  _  f.(2)  .  j 

yn+\,j  Cy  *>  '  n+  1./  + 1  l‘n.j  ' 

with  probability  v'n'J/(n  +  1),  while  filling  a  twin  vacancy  at  level  j  results  in 
r/<  u  =  1/(2)  _  (2)  _  2  y( 2)  —  ,,(2)  4.2 

yn+l./  V/  T  1,  yn  +  l,j  VnJ  *">  *n+l./  +  l  t'«,/+  I  ^  ** 

with  probability  t +  1).  The  remaining  vacancy  numbers  are  not  changed.  If  we 
again  introduce  the  random  polynomials 

K n  =  2  ^t2)  =  2 

j>2  j> 2 

we  obtain  for  their  expected  values  the  equations 

£(  -  ~-tE{ »t)V)1  +  -tt£{  K2\x)\. 


nT7  1 


n  +  1  ( 

n  -  l  +  2x 


„  +  l  «C*2V)). 

n  >  I,  with  E\Wl'\x)}=  0,  f  Wl2,(x)}  -  2x. 
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Since 

E{Wl'\x))  + E{W?\x)}  =  E{W„(x)l 
we  obtain  by  substitution 


E  ( wi'U*)}  -  Sri £  { "'■"<*>}  +  e  ! w ■("))• 

from  which  by  using  (4.2)  we  have 


n  —  1  .  r+l r 

= _ 1 _ y _ ! _ y 

n(n-\)  r%  (r-  1)!  _ 


(2X)J- 


j-  t 


n  n  —  I 

1  2  1 


n(n  +  1)  jil  ,-y-l  (r~  1)! 


j~  1 


Thus, 


£  f  K<"  )  =  2‘  '  V  1  '  r 

1  '  »(»+!), -7- i  ('■-1)1  D"  1  J’ 


n  >  1.  (4.6) 


(4.7) 


y  =  2, n  >  1,  while  the  corresponding  expression  for  twin  vacancies  is  obtained 
by  subtracting  (4.7)  from  (4.3)  with  (4.7)  set  equal  to  zero  fory  =  n  +  1  (see  Appendix, 
Table  A.6). 

We  conclude  this  section  by  computing  the  expected  number  of  leaves  in  a  random 
binary  tree  with  n  nodes.  From  the  first  equality  in  (4.6)  the  expected  number  of  all 
single  vacancies  equals 


£{^!)(1)} 


1 

n(n~\) 


n~  1 


2  '£{  KW) 


i 

„(„-!) 


n  —  1 


2 


since  by  (4.2)  E  { W^n(l)}  =  n  +  1.  The  latter  sum  equals  ^(n  +  1)  so  that  the  expected 
number  of  all  twin  vacancies  is  n  +  1  -  ^(/j  +  1).  The  expected  number  of  leaves  is 
clearly  half  the  number  of  twin  vacancies,  that  is  j(n  +  1).  Thus,  in  a  random  binary 
tree  on  the  average  about  ^  of  the  nodes  are  leaves. 

Remark.  Having  in  mind  the  process  of  growing  random  binary  trees  as  described 
above  we  can  also  look  at  a  random  process 

Nk  =  min(n  :  Hn  >  k),  N0  =  1,  (4.8) 

i.e.,  the  time  ( =  number  of  symbols)  needed  to  grow  a  tree  over  the  height  k  (see 
Appendix,  Table  A.8).  From  (4.8)  clearly  P(Nk  >  n)  =  P(H„  <  k)  =  F(n,k)  so  that 
g*(x)  =  1  -  (1  -  x)fk(x)  is  the  ordinary  probability  generating  function  gk(x)  = 
'£n>\P(Nk  =  n)xn  of  the  random  variable  Nk.  Denoting 


Hm'k)  "  EiNk(Nk  -  1)  •  *  ’  (Nk  ~  m  +  1)} 
the  mth  factorial  moment  of  Nk  and  using  the  fact  that 

^m'k)=~dPi 
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we  obtain  by  applying  Leibnitz  formula  to  (3.13)  the  relation 

ji(m  +  2,k+  1)-  S(^12)f‘0'+  !.*)• 

w  >  0,  k  >  0.  In  particular  with  m  =  0  this  becomes 

*>0.  or 

VarfJV,,,}  -2(£{JVt })!+  £(iVlil}[l  -  £{«,*,}]. 

Unfortunately,  it  is  the  first  moment  E  {Nk},  which  is  hard  to  obtain  for  large  k  (see 
Appendix,  Table  A.7). 

Since  completion  of  this  work,  some  related  material  has  been  published  by  Ruskey 

[11]. 
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Appendix 


TABLE  A.l 


Binary  Trees  with  n  Nodes  and  Height  k%  t(n,  k)  (2.3) 


No.  of 
Symbols 

n\ 

0 

1 

2 

3 

4 

Height 

5  6 

7 

8 

9 

10 

0 

1 

1 

1 

1 

0 

1 

2 

2 

0 

0 

2 

3 

5 

0 

0 

1 

4 

4 

14 

0 

0 

0 

6 

8 

5 

42 

0 

0 

0 

6 

20 

16 

6 

132 

0 

0 

0 

4 

40 

56 

32 

7 

429 

0 

0 

0 

1 

68 

152 

144 

64 

8 

1430 

0 

0 

0 

0 

94 

376 

480 

352 

128 

9 

4862 

0 

0 

0 

0 

114 

844 

1440 

1376 

832 

256 

10 

16796 

0 

0 

0 

0 

116 

1744 

4056 

4736 

3712 

1920 

512 

TABLE  A.2 


Partitions  of  Permutations  of  n  Symbols  into  Binary  Trees  of  Height  k  ( Adapted  from  (3.9)) 


No.  of 
Symbols 

n\ 

0 

1 

2 

3 

4 

Height 

5 

6 

7 

8 

9 

10 

0 

1 

1 

1 

1 

0 

1 

2 

2 

0 

0 

2 

3 

6 

0 

0 

2 

4 

4 

24 

0 

0 

0 

16 

8 

5 

120 

0 

0 

0 

40 

64 

16 

6 

720 

0 

0 

0 

80 

400 

208 

32 

7 

5040 

0 

0 

0 

80 

2240 

2048- 

608 

64 

8 

40320 

0 

0 

0 

0 

11360 

18816 

8352 

1664 

128 

9 

362880 

0 

0 

0 

0 

55040 

168768 

104448 

30016 

4352 

256 

10 

3628800 

0 

0 

0 

0 

253440 

1508032  1277568 

479040 

99200 

11008 

512 
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HEIGHT 

Figure  A.  I .  Normalized  asymptotic  distribution  of  binary  tree  height  (2.42). 


TABLE  A.3 


Random  Binary  Tree  Height  ( Adapted  from  (3.9)  and  (3.23)) 


No.  of 
Symbols 

Minimum 

Height 

Mean 

Variance 

0 

0 

0.000 

0.000 

1 

1 

1.000 

0.000 

2 

2 

2.000 

0.000 

3 

2 

2.667 

0.222 

4 

3 

3.333 

0.222 

5 

3 

3.800 

0.426 

6 

3 

4.267 

0.507 

7 

3 

4.670 

0.570 

8 

4 

5.018 

0.682 

9 

4 

5.340 

0.774 

10 

4 

5.641 

0.837 

50 

6 

10.810 

2.051 

100 

7 

13.286 

2.522 

150 

8 

14.778 

2.772 

200 

8 

15.852 

2.934 

250 

8 

16.693 

3.052 

300 

9 

17.385 

3.147 

f1 1.443  In  (n) 

-3.259  ln(n) 

’f  indicates  next  higher  integer. 
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TABLE  A.4 


Expected  S' odes  by  Level  in  Random  Binary  Trees  (4.5) 


Level 

No.  of 
Symbols 

1 

2 

3 

4 

5 

6 

7 

8 

9 

10 

0 

0.0000 

1 

1.0000 

2 

1.0000 

1.0000 

3 

1.0000 

1.3333 

0.6667 

4 

1.0000 

1.5000 

1.1667 

0.3333 

5 

1.0000 

1.6000 

1.5333 

0.7333 

0.1333 

6 

1.0000 

1.6667 

1.8111 

1.1222 

0.3556 

0.0444 

7 

1.0000 

1.7143 

2.0286 

1.4794 

0.6254 

0.1397 

0.0127 

8 

1.0000 

1.7500 

2.2036 

1.8016 

0.9171 

0.2786 

0.0460 

0.0032 

9 

1.0000 

1.7778 

2.3476 

2.0911 

1.2155 

0.4514 

0.1028 

0.0131 

0.0007 

10 

1.0000 

1.8000 

2.4684 

2.3515 

1.5122 

0.6494 

0.1828 

0.0323 

0.0032 

0.0001 

TABLE  A. 5 


Expected  Vacancies  by  Level  in  Random  Binary  Trees  (4.3) 


No.  of 
Symbols 

Total 

Vacancies 

1 

2 

3 

Level 

4  5 

6 

7 

8 

9 

10 

11 

0 

1 

1.0000 

1 

2 

0.0000 

2.0000 

2 

3 

0.0000 

1.0000 

2.0000 

3 

4 

0.0000 

0.6667 

2.0000 

1.3333 

4 

5 

0.0000 

0.5000 

1.8333 

2.0000 

0.6667 

5 

6 

0.0000 

0.4000 

1.6667 

2.3333 

1.3333 

0.2667 

6 

7 

0.0000 

0.3333 

1.5222 

2.5000 

1.8889 

0.6667 

0.0889 

7 

8 

0.0000 

0.2857 

1.4000 

2.5778 

2.3333 

1.1  I 1 1 

0.2667 

0.0254 

8 

9 

0.0000 

0.2500 

1.2964 

2.6056 

2.6861 

1.5556 

0.5111 

0.0889 

0.0063 

9 

10 

0.0000 

0.2222 

1.2079 

2.6041 

2.9667 

1.9796 

0.8000 

0.1926 

0.0254 

0.0014 

10 

11 

0.0000 

0.2000 

1.1316 

2.5853 

3.1908 

2.3750 

1.1159 

0.3333 

0.0614 

0.0063 

0.0003 

TABLE  A.6 


Expected  Single  Vacancies  by  Level  in  Random  Binary  Trees  (4.7) 


Level 

No.  of 
Symbols 

1 

2 

3 

4 

5 

6 

7 

8 

9 

10 

0 

1 

l.oooo 

0.0000 

2 

0.0000 

0.1667 

3 

0.0000 

0.1111 

0.1111 

4 

0.0000 

0.0750 

0.1167 

0.1000 

5 

0.0000 

0.0533 

0.1022 

0.1467 

0.1067 

6 

0.0000 

0.0397 

0.0862 

0.1603 

0.2032 

0.1270 

7 

0.0000 

0.0306 

0.0724 

0.1585 

0.2680 

0.2993 

0.1633 

8 

0.0000 

0.0243 

0.0612 

0.1501 

0.3057 

0.4643 

0.4603 

0.2222 

9 

0.0000 

0.0198 

0.0522 

0.1394 

0.3241 

0.6019 

0.8226 

0.7309 

0.3160 

10 

0.0000 

0.0164 

0.0449 

0.1283 

0.3299 

0.7084 

1.1967 

1.4804 

1.1895 

0.4655 
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TABLE  A. 7 


Distribution  of  Comparisons  for  Insertion  of  Random  Symbol  in  Random  Binary  Tree 
((4.4).  Expected  Fraction  of  Vacancies  by  Level  in  Random  Binary  Trees  h ilh  k  =  Level  -  1) 


Comparisons,  k 

No.  of 
Symbols 

0 

1 

2 

3 

4 

s 

6 

7 

8 

9 

10 

0 

1 

1.0000 

0.0000 

1.0000 

T 

0.0000 

0.3333 

0.6667 

3 

0.0000 

0.1667 

0.5000 

0.3333 

4 

0.0000 

0.1000 

0.3667 

0.4000 

0.1333 

5 

0.0000 

0.0667 

0.2778 

0.3889 

0.2222 

0.0444 

6 

0.0000 

0.0476 

0.2175 

0.3571 

0.2698 

0.0952 

0.0127 

7 

0.0000 

0.0357 

0.1750 

0.3222 

0.2917 

0.1389 

0.0333 

0.0032 

8 

0.0000 

0.0278 

0.1440 

0.2895 

0.2985 

0.1728 

0.0568 

0.0099 

0.0007 

9 

0.0000 

0.0222 

0.1208 

0.2604 

0.2967 

0.1980 

0.0800 

0.0193 

0.0025 

0.0001 

10 

0.0000 

0.0182 

0.1029 

0.2350 

0.2901 

0.2159 

0.1014 

0.0303 

0.0056 

0.0006 

0.0000 

TABLE  A.8 


Pi  umber  of  Random  Symbols  Seeded  to  Grow  a  Binary  Tree 
of  Height  Exceeding  k  (4.8) 


Height 

k 

E{Sk) 

Number  of  Symbols 

o{Sk) 

Range 

0 

1 

0 

1 

1 

2 

0 

2 

2 

3.33 

0.491 

3-4 

3 

5.13 

0.995 

4-8 

4 

7.53 

1.861 

5-16 

5 

10.75 

2.931 

6-32 

6 

15.02 

4.533 

7-64 

7 

20.67 

6.680 

8-128 

8 

28.13 

9.557 

9-256 

9 

37.92 

13.512 

10-512 
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